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ABSTRACT 

The Educational Information Network (EIN) enables 
member colleges, universities, and educational service organizations 
to share their computational resources among interested constituents. 
To accomplish this objective, the EIN Software Catalog is published, 
describing the available resources. The purpose of this handbook is 
to define the documentation for programs and systems of programs 
available through EIN. By following the guidelines in the handbook, 
participating institutions facilitate the listing of their programs 
in the Catalog. (JY) 
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DOCUMENTATION STANDARDS HANDBOOK FOR THE EIN SOFTWARE CATALOG 



Introduction 

The Educational Information Network (EIN) is an EDUCOM project, 
jointly sponsored by the National Science Foundation (NSF) and 
the U.S. Office of Education. The major objective of the pro- 
ject is to enable member colleges, universities, and education- 
al service organizations to share their computational resources 
among interested constituents. To accomplish this objective, 
the EIN Software Catalog is published, describing the available 
resources . 

The catalog listings provide the information necessary to per- 
mit a user to identify and locate those computational resources 
of potential utility. After selecting the resource that best 
answers a particular need, the user will then prepare data ac- 
cording to the required format and send the input to the selec- 
ted resource center for processing. A record-keeping system 
will assure the resource center of reciprocal credit or payment 
for its services and will, in addition, monitor the progress 
of resource utilization throughout the network. 

To expedite the technical aspects of this process, it is neces- 
sary for EDUCOM to receive properly prepared documentation from 
participating institutions through their designated technical 
representatives. The purpose of this handbook is to define 
the documentation for programs and systems of programs avail- 
able through EIN. 

The EIN staff recognizes that present documentation at partici- 
pating institutions does not exist in any common format, and 
that, in many instances, the documentation submitted will not 
easily meet the format specified in this handbook. However, 
it is imperative that all information herein requested be in- 
cluded, regardless of the format chosen. 
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Documentation Criteria 

Documentation that meets the EDUCOM standards must include the 
amount of information necessary to inform a prospective user 
of the precise problem that the program was designed to solve, 
and to enable him to prepare the required input for the desired 
results. For extended systems of programs, the catalog entry 
should identify the capabilities of the system so that the read- 
er can judge the value of studying the references in more detail. 

The central item in the catalog entry is a 300-word program ab- 
stract that summarizes the functional features of a program and 
clearly states those processing options which make it useful. 
While abstracting is known to be somewhat of an art, we believe 
that careful attention to detail in the content of the abstract 
is more important that strict attention to form. 

Each catalog entry will be carefully checked for the inclusion 
of the information items specified in this handbook. Those 
deficient in terms of content will be returned to the submitter 
so that he can supply the missing items. In addition, all ent- 
ries will be returned to the resource institution for a final 
examination before printing. 

Constructive suggestions for improvement of the format and/or 
the definitions are welcome. Any suggested changes will be 
placed before the EIN Executive Committee. 

Model Catalog Entry 

A model catalog entry for a program submitted by the University 
of Iowa has been reproduced in the following pages. The entry 
is representative of the type that we hope to receive from sub- 
mittors . 
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EIN DOCUMENTATION STANDARDS FOR PROGRAMS 



Field 

DESCRIPTIVE TITLE 
CALLING NAME 

INSTALLATION NAME 

AUTHOR(S) AND 
AFFILIATION(S) 



LANGUAGE 
COMPUTER 
PROGRAM AVAILABILITY 

CONTACT 



Definition 

This title is the name that describes 
what the program does. 

This item provides the calling name or 
loading name of the program, including 
any symbols needed for identification 
in the computer center’s library. 

Enter the name of the university com- 
puter center at which the program is 
presently operating and its location 
(if different f n 'om the university) . 

The author of the program is usually 
the person who did the actual program- 
ming and design work. If these tasks 
were separate, please list both parties 
and their present affiliations (spefific 
office or particular laboratory) . An 
author may request that his name not 
appear in the published catalog. 

The programming language in which the 
program is written. 

The computer on which the program 
executes . 

Specify whether program decks and/or 
listings are available on user request. 

Person(s) to contact at the institution 
for further information. 



FUNCTIONAL ABSTRACT 

This information is central to the catalog entry. It should 
contain the following. 

a. Description of the Problem 

The problem that the program is designed to solve should be des- 
cribed in such a way as to help the reader to identify elements 
that may be analogous to his own problem. While this descrip- 
tion must be brief, it should clearly identify the conceptual 
limitations of the program. 
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Field & Definition (for Programs; continued) 
b. Method of Solution 

When the method of solution is well known or documented in stan- 
dard publications, it should be identified by references. Modi- 
fications to well-known methods, new methods, or novel combina- 
tions of methods should be fully described to indicate their 
applicability . 

o. Special Features of the Program 

Processing features and options that contribute to the unique- 
ness of the program should be summarized. Types of input and 
output should be discussed in terms of their potential value 
for solution of problems. 

d. References 

Books, periodicals, and other supporting literature related to 
the program and its methodology should be listed. 

USER INSTRUCTIONS 

This item will be the basis for data preparation. Hence, in- 
structions regarding data requirements should be clearly and 
explicitly stated as follows. 

a. Input Preparation Formats and Options 

These instructions provide the user with the information neces- 
sary to prepare his data for input to the program in terms of: 

( 1 ) * the precise definition of all variables; 

(2) the exact format and arrangement of input parameters; 

(3) the required card or tape format for all input data; 

(4) the sequence of control statements, if required. 

b. Output Formats and Options 

These instructions will clearly explain all output variables 
and the choices of formats available for selection. Some note 
regarding the accuracy of results also should be included. 

a. Data Restrictions 

The user must be provided with a full explanation of any data 
restrictions such as those constituting illegal input, numerical 
or data-set limitations, and the number or size of the data sets 
that can be handled by the program. 

d. Procedural References 

Manuals, detailed documentation, etc. required to use the pro- 
gram are listed. 

SAMPLE INPUT/OUTPUT MODELS 

This item will help the user to understand all of the aforemen- 
tioned information quickly and easily. The sample test case 
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Field & Definition (for Programs ; continued) 

demonstrates the main features of the program through graphic 
illustrations (printouts) of real results. 

COST ESTIMATE 

A statement of the number of computational units used in pro- 
cessing the sample problem should be provided to give the user 
an idea of the potential time and costs for using the program. 
Actual or approximate dollar value should be stated where pos- 
sible. 



EIN DOCUMENTATION STANDARDS FOR SYSTEMS SOFTWARE 



The description of systems follows the guidelines for Programs. 
However, the documentation should reference operational details 
rather than describe them in full. The fields should include 
the following information. 



Field 

DESCRIPTIVE TITLE 



CALLING NAME 

INSTALLATION NAME 

AUTHOR(S) AND 
AF F I L I AT I ON ( S ) 



Definition 

As for Programs, including the speci- 
fication of the type of software des- 
cribed. For example, Remote Inter- 
active System or Operating System. 

As for Programs. 

As for Programs . 

As for Programs; this may include the 
source of the system. 



LANGUAGE 



As for Programs, if applicable. 



COMPUTER 
PROGRAM AVAILABILITY 

CONTACT 



As for Programs . 

As for Programs. Specify whether or 
not the availability is without charge. 

As for Programs. This person will be 
used more heavily than for user programs. 



Field & Definition (for Sy stems ; continued) 



FUNCTIONAL ABSTRACT 

The functional abstract should include a brief summary of the 
software: what it does, how it does it, and any other informa- 

tion a person may require to determine whether it can be help- 
ful to his needs. The purpose of the abstract is to inform of 
the uses to which the system is suited. 

USER INSTRUCTIONS 

This item will be the basis for a user’s decision as to whether 
the constraints of the system are acceptable to his purpose. 

It should include the data restrictions, as well as the hard- 
ware, software, and interface constraints, if applicable. Other 
special requirements should be mentioned. 

References should be cited, including manuals, articles, and 
all other material required for user understanding of the sys- 
tem. Location and prices of this material should be given. 

EIN will maintain a copy of each item in the central office as 
a backup reference. 

SAMPLE INPUT/OUTPUT MODELS 

Exclude except for writeups of specific programs, where desired. 
COST ESTIMATE 

The pricing algorithm for use of the system should be given. 
Examples, where applicable, are to be included. 



EIN DOCUMENTATION STANDARDS FOR COMPUTER FACILITIES 

The description of facilities should follow the guidelines for 
Programs. However, rather than including full operational de- 
tails, the documentation should reference them. The fields 
should include the following information. 



Field Definition 

DESCRIPTIVE TITLE As for Programs. 



INSTALLATION NAME As for Programs. 



LANGUAGE As for Programs. 
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Field & Definition (for Facilities; continued) 

COMPUTER As for Programs. 

AVAILABILITY Include access information. 

CONTACT As for Systems Software 

FUNCTIONAL ABSTRACT 

The purpose of the functional abstract is to describe the po- 
tential utility of the facility in broad terms. The abstract 
should include a description of the hardware and special soft- 
ware, the executive system and special peripheral equipment 
available. Access information should also be given, includ- 
ing type of access and any special constraints (type of ter- 
minals needed for remote entry, length or width of tapes, etc.). 

USER INSTRUCTIONS 

As for Systems Software, include a preliminary bibliography 
of system references and any special local references. The 
times the system is available should be listed. The services 
provided by the installation should be described (consulting, 
programming, debugging, keypunching, etc.), as well as any 
special procedures or options that should be noted. 

SAMPLE INPUT/OUTPUT MODELS 

Exclude except for writeups of specific programs, if desired. 
COST ESTIMATE 

This should include the rates for consulting, t ~ tage and 
handling, and other services, as well as actual computer 
charges. Also, indicate any differential between educational 
and noneducational rates. 
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DESCRIPTIVE TITLE General Multiple Regression Analysis 



CALLING NAME RE GAN 1 



INSTALLATION NAME 



AUTHOR(S) AND 
AFFILIATION(S) 



LANGUAGE 



The University of Iowa 
University Computer Center 

Stan Walljasper, IBM 7044 version 
Converted to IBM 360 by Louise R. Levine 
The University of Iowa Computer Center 

FORTRAN IV (G) 



COMPUTER IBM 360/65 



PROGRAM AVAILABILITY Decks and listings presently available 

CONTACT Mrs. Louise R. Levine, Program Librarian, 
Applications Programming, University 
Computer Center, The Univ. of Iowa, 

Iowa City, Iowa 52240 
Tel.: (319) 353-5580 

FUNCTIONAL ABSTRACT 

REGAN1 computes the Pearson Product Moment correlation matrix, 
mean, standard deviation, and population standard deviation. If 
desired, the multiple R, standard error of the estimates, F 
ratio, degrees of freedom, regression coefficients, partial coef- 
ficients and F ratio associated with each individual variate, 
the intercept constant, and residuals also may be calculated. 

There is no allowance for missing data. The basic equations 
and a simple explanation may be found in Cooley and Lohnes . 1 

REFERENCES 

1. Cooley, W.W. , and Lohnes, P.R. , Multivariate Procedures for 
the Behavioral Sciences, (John Wiley § Sons, Inc., New York, 
1962), pp. 31—45 . 
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USER INSTRUCTIONS 



Input Deck 

The input deck consists of the following cards in the order 
described below. 



Title Card 

This card gives any alphanumeric information to be printed as 
a heading on the output. 

Problem Card 



Columns 

4- 5 
6-10 
11-15 

16-20 

21-30 



31-40 

41 

42 



Contents 

Number of variables (independent + dependent <_ 50) 

Total number of observations 

1: No data error check is desired 

2: Data error check is desired 

1: No residuals are to be calculated 

2: Residuals are to be calculated 

If data error check is requested, punch the lower 
bound for allowable data. The assumed format is 
F10.5. Leave blank if no data error check is re- 
quested . 

If data error check is requested, punch the upper 
bound for allowable data. The assumed format is 
F10.5. 

Number of Format Cards (£ 3) 
blank: card input 

2, 3, 4, 8, or 9: unit number of disc or tape input 

Appropriate DD cards for this file should be placed 
before th^ //GO.SYSIN card. 



Format Cards 

Columns Contents 

1-80 The data should be read by subject, for all variables 

with an E or F type FORTRAN format. A maximum of 
3 Format Cards is allowed. 

Data Cards 

The Data Cards contain the records of observations. Any uniform 

continued 
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number of cards per observations may be used. Each observation 
must start on a new card. 

/ JS Card 

//GO . FTO 5F00n DD 5 « Card 

This card must be present for each problem, with n taking the 
value of 2 for the first problem and increasing by one for each 
additional problem. 

Permutation Cards 

Columns Contents 

5 1: No residuals are desired 

2: Residuals are desired 

6-10 Number of variables to be used for the multiple R 

(independent + dependent variables) 

11—12 Dependent variable number 

13-14 Independent variables by number. If there are more 

than 34 independent variables, continue on another 

card beginning in Cols. 1—2. 

• • 

• • 

• • 

79-80 

Only one problem can be run with as many Permutation Cards 
(max. 20) as necessary. 

Program Deck 

//name JOB (project no., etc.) 

//step EXEC STAT , PARM . GO= 1 REGAN 1 * 

//GO.SYSIN DD 55 
Title Card 
Problem Card 
Format Card 
Data Cards 

/* 

//GO . FT05F002 DD 55 

Permutation Cards 

/* 

Method 

The data are read and the summation on matrices are updated using 
an auxiliary index matrix. 1 Means, deviations, and product-moment 

continued 
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correlations are computed by a triangular method. 2 The simple 
correlation matrix is transferred to a work area from which the 
multiple R, Beta weights, etc., are calculated. The basic 
equations and a simple explanation may be found in Cooley and 
Lohnes . 3 

REGAN1 runs on a 360/65 under HASP and OS with FORTRAN IV (G) 
and uses 70K bytes of core during execution with the source 
deck. A work area for data set reference number 1 is required. 

REFERENCES 

1. Caffrey, J. "CRAM, Algorithm 67", Collected Algorithms from 
CACUy (A.C.M., New York). 

2. Hafley, W.L. , and Lewis, J.S., "Triangular Regression, 
Algorithm 142", Collected Algorithms from CACM , (New York, 
A.C.M.) . 

3. Cooley, W.W. , and Lohnes, P.R., Multivariate Procedures for 
the Behavioral Sciences , (John Wiley § Sons, Inc., New York, 
1962) . 
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SAMPLE INPUT 



TESTING Rf GAM 

4 21 X 2 1 

MFlu.OI 



66.C 


2.53 


30.6 


3.6 


77.6 


2.61 


34.0 


3.8 


83. B 


2.62 


46. C 


3.6 


83. C 


2.66 


35.7 


5.S 


63.6 


2.62 


54.1 


5.3 


SO. 5 


2.86 


59.3 


3.3 


92.5 


2.96 


51.9 


3. C 


S3. 2 


3.2C 


52.6 


2.9 


S3, 6 


2. SC 


51.7 


5.5 


93.3 


3.06 


67.4 


4.4 


94.7 


3.36 


7C.C 


4.1 


se.c 


3. 69 


67.6 


4.3 


100.7 


3.79 


60.9 


6.6 


101.5 


4.38 


75.3 


5.5 


103.1 


4.41 


74.8 


5.5 


104.2 


4.35 


71.7 


6.7 


ir 5. 4 


4.33 


63.0 


5.5 


1C6.7 


4.26 


67.1 


5.7 


106.1 


4.40 


94. C 


5.2 


10S.9 


4.49 


107.4 


4.5 


113.1 


5.13 


116.0 


3.6 



/* 

//<?DtFTC5F0C2 CD * 
2 4 4 12? 

/* 

// 
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SAMPLE OUTPUT 

TESTING REGAN1 



ECHO CHECK 



NC.CF 


VARIABLES = 




4 








NO. OF 


OBSERVATIONS 


X 


21 








MSI 






) 








MS2 






2 








ER1 






J.P 








EP2 
( 4F10 


.01 




0.0 








68 


•CCO 2.530 


30.600 


3. 


00 0 




ACTUAL 


NOB - LI 












VAR 


MFAN 




STD. DEV. 




POP. S.l 


D. 


1 


L.95A713E 


02 


0.115109E 


02 


0.112334E 


02 


2 


0.357L90E 


01 


J.012613E 


01 


0.793029E 


00 


3 


0.663A75E 


02 


O.229507E 


02 


0.224054E 


02 


4 


0.472857E 


01 


J .1 13981E 


Cl 


Q.111234E 


01 



CORR. MATRIX 


Rdf 1) 


R ( It 2) 


RUf 3) 


RUf 41 


RUf 51 


RUf 61 


RUf 7) 


R ( 2tJ) 


0.919635 














R ( 3.J) 


0.91 5466 


0.901040 












R( 4tJ> 


0.342833 


0.355016 


0.164767 











DEPENDENT VARIABLE IS NUMBER 4f INDEPENDENT VARIABLES ARE If 2t 3» 



MLT. R. 


STD. ERROR 




0.56936973E 00 


0.10163383E 


01 


B 


BETA 




0.757350S2E-01 


0. 76484263E 


00 


0.101 65939E 01 


0 . 72476649E 


oc 


0.59031434E-01 


-0.11890 440E 


01 



P OF 

O.271023G2E 01 0.170CPQ00E 02 

F 

O.160574O5E 01 
0.17452154E 01 
O.493O2360E 01 



INTERCEPT -0.22165442E 01 



TABLE OF RES JOUALS 



V 


V 


RESIDUAL 


RESIDUAL 


ESTIMATE 


VALUE 




STE 


3.69906 


3.00000 


0.10094 


0.09931 


4.32169 


3.00000 


-0.52109 


-0.51350 


4.28140 


3.00000 


-0.40140 


-0.47367 


4.66619 


5.90000 


1.23301 


1.21398 


3.59993 


5.30000 


1.70007 


1.67274 


4.04438 


3.30000 


-0.74430 


-0.73241 


4.73434 


3.00000 


-1.73434 


-1.70646 


4.99001 


2.90000 


-2.09001 


-2.05641 


4.76846 


5.300*0 


0.73154 


0.71978 


3.9R160 


4.40000 


0.41040 


0.41160 


4.23 912 


4.10000 


-0.13912 


-0.13689 


5.15771 


4.30000 


-0.05771 


-0.04393 


5.66786 


6.00000 


1.13214 


1.11394 


5.47818 


5.50000 


0.02102 


0.02147 


5.65937 


5.50000 


-0.15937 


-0.15601 


5.86468 


6.70000 


0.03532 


0.02109 


5.26810 


5.50000 


0.23102 


0.22010 


5.05344 


5.70000 


0. 64656 


0.63616 


4.09440 


5.20000 


0.3055? 


0.30061 


4.33127 


4.50000 


0.16073 


0.16602 


4.59051 


3.00000 


-0.79051 


-0.70567 



O.17550023E 02 







1 

2 

3 

4 

5 

6 
7 
0 
9 

10 

11 

12 

13 

14 

15 

16 
17 
10 

19 

20 
21 



8/70 



16 



000 0107 



000 0107 



EDUCOM 



EDUCATIONAL INFORMATION NETWORK 



000 0107 



COST ESTIMATE 

The costing algorithm is based on the amount of computing time, 
operator time, core storage, cards in and out, and lines and 
pages printed. The total cost for the Sample Output above 
(before a slight change in the pricing schedule) was $0.56. 

Charge to user = computer costs + postage + network overhead 

= $0.56 + postage + network overhead 



CONTENTS— REGAN1 
pages 

9 Identification § Abstract 

11-13 User Instructions 

15-16 I/O 

17 Cost — Contents 
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